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Abstract. Given a real, symmetric matrix S, we define the slice T§ through 
S as being the connected component containing S of two orbits under conjuga- 
tion: the first by the orthogonal group, and the second by the upper triangular 
group. We describe some classical constructions in eigenvalue computations 
and integrable systems which keep slices invariant — their properties are clar- 
ified by the concept. We also parametrize the closure of a slice in terms of a 
convex polytopc. 



The basic definition 

Let S be a real n x n symmetric matrix with simple spectrum a(S) = {X± > 
. . . > A„}. When is a matrix simultaneously an orthogonal and an upper triangular 
conjugation of 5? More precisely, we consider the slice !Fs through S, defined to 
be the intersection of the sets {Q T SQ, for arbitrary real orthogonal matrices Q 
with det Q = 1} and {RSR^ 1 , for arbitrary real upper triangular matrices R with 
diagi? > 0}. 

How large is a slice? Clearly, if Q T SQ = RSR^ 1 then SQR = QRS, and 
hence, since S has simple spectrum, QR = f(S), for some (real) polynomial /. 
The matrices Q and R are uniquely determined from f(S) if f(S) is invertiblc: 
this is the standard QR- factorization of a matrix (0). For convenience, we write 
f(S) = QR=[f(S)] Q [f(S)] R . 

Slices have been appearing in disguised form in the literature of numerical analy- 
sis and integrable systems, and questions about the geometry of slices have come up 
intermittently. By putting these three aspects side by side, we expect to convince 
the reader that the concept is indeed a natural one. 

Slices and the computation of eigenvalues 

Francis, in his fundamental work on the QR algorithm (|(|) considered the fol- 
lowing map between matrices. Take S invertible symmetric, factor S = QR and 
define the QR step S' = RQ. It is clear that S' is symmetric with the same spec- 
trum of S, since S' = Q T SQ. But more is true: <S" = RSR^ 1 . In particular, as 
Francis had already pointed out, S and S' have the same bandwidth (the reader 
should compare this argument with the usual one [JL3|). Also, S' belongs to the slice 
through S: in the notation for the elements of Ts presented in the introduction, S' 
is associated to the function f(x) = x. 
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Numerical analysts also know well that = [S k ]^S[S k ~\ ^ equals the matrix 

obtained by applying k times the QR step starting from S: in other words, S^ k ' is 
the fc-th step of the QR iteration starting from S = S<-°\ Again, S<*) belongs to 



Ts, and is associated to f(x) — x k . 

Instead, by taking f(x) = x 1 ^, the resulting matrix S 1 -- 1 ^^ is, in a precise sense, 
the 1/fc-th QR step. Taking the limit (@) 

lim (S* 1 /*) - S^)k = hm S(e) - - 

yields a vector field S — X(S), whose solution S(t) starting at 5(0) = S at time 
fc equals S^ k ' ! The computation of this limit appears in a number of arguments in 
the subject. The expression for S^ e ' is 

S^=[sf Q S[S<] Q , where S*=[S%[S*] R . 

Evaluating the derivatives in e, we learn that 



(logS)S^ = (^] Q )[S^ R +[S%(^] R ). 
From the last equation, 



[s% (log s)s< [s^] - 1 = [S%(- [S*] q) + (— [S<] R ) [s<] ~\ 

which obtains 

t^ 6 ] Q^°S 5) q f*^ 6 ] R [5 C ] = log = \_S e ~^ [ l ^' e ] q) t 1 ^* 6 ] r) t*^ 6 ] fl^' 

Now evaluate the derivatives at e = 0: we must have I = S° = I.I = [S°] [S°] R 
and 5*^°^ = S. The equation above yields 

log5=^[5%| e=0 + ^[^] fl | e=0 . 

The two terms in the right hand side of the last equation are special matrices: 
they are respectively skew symmetric and upper triangular. Consider the (unique, 
linear) decomposition of a matrix M = II a M + II U M as a sum of a skew symmetric 
and an upper triangular matrix. Then, from the expression for the derivative of 
S^ e \ jfe [^ c ]q\(=o = n a log S. The vector field which interpolates the QR iteration 
then is X(S) = [S,n o log5]. 

The iterations and flows defined above lie in Ts- In particular, both preserve 
the eigenvalues of the initial condition, its symmetry and its bandwidth. 

For numerical analysts, the asymptotic behavior of the QR iteration, well known 
to Francis, is of capital importance. Say, for example, that J is an arbitrary Jacobi 
matrix (i.e., a real, tridiagonal matrix whose entries Jk,k+i — Jk+i,ki k = 1, . . . ,n — 
1 are strictly positive). Starting with J, the QR iteration converges to a diagonal 
matrix D. Not only D and J have the same spectrum but D must lie in the 
closure of Tj, the slice through J. Steps of QR type related to different functional 
parameters / give rise to iterations which, starting from J, always converge to 
diagonal matrices, with diagonal entries consisting of the (distinct) eigenvalues 
of J in an arbitrary order. The closure of the slice Tj clearly ought to contain 
additional points. In a nutshell, they correspond to reducible Jacobi matrices, i.e., 
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matrices for which some entry Jk,k+i = Jk+i,k is zero. This will be explained in 
the sequel. 

Slices and the Toda flows 

We now describe briefly the Toda lattice, which is an integrable system for which 
slices appear as natural phase spaces. Consider n particles on the line with positions 
Xk and velocities yk evolving under the Hamiltonian 

1 n n— 1 

k=l k=l 

This Hamiltonian was introduced as a model of wave propagation in one di- 
mensional crystals ([[l6)). It was Flaschka's remarkable discovery (||) that this 
dynamical system is equivalent to the matrix differential equation 

j={J,U a J], 

where J is the Jacobi matrix with nonzero entries 

Jk,k = -yjk/2 and J Kk+ i = Jk+i,k = e («»-«*+0/ 2 /2. 

The fact that the differential equation is in the so called Lax pair form (|))) 
implies that the evolution J(t) is actually an orthogonal conjugation of the initial 
condition J(0). But again more is true: the evolution stays within the set of 
Jacobi matrices, as we should expect, since there are no other physically significant 
variables in the problem (velocities essentially are the diagonal entries, and distances 
between particles with neighboring indices correspond to the off-diagonal entries). 
Indeed, it is not hard to check that 

J(t) = [e tJ %J(0)[e^] Q = [e tJ ^] R J(0)[e tJ M]-\ 

the celebrated solution of the Toda lattice by factorization ([0). To check this 
formula, proceed as in the computation of the limit of the previous section. Thus, 
again, the orbit J(t) lies within the slice through J(0). 

Moser also computed the asymptotic behavior of the Toda lattice for 
f(x) = x, the orbit J(t) starting from a Jacobi matrix J(0) converges to a diagonal 
matrix, with eigenvalues disposed in decreasing order. This is in accordance with 
the relationship between Toda and QR and has a natural physical interpretation. 
Diagonal entries are velocities: in the long run, particles move apart and tend to 
undergo uniform motion, each with speed given by a different eigenvalue. Faster 
particles move ahead, explaining the orderd outcome of the eigenvalues along the 
diagonal found in applications of the QR iteration. The remarkable fact that as- 
ymptotic speeds both at — oo and +oo are the same will not be relevant to us: 
this is strong evidence of the integrability of the system — each orbit 'remembers' 
this data. Other matrices belonging to the boundary of the slice through J(0) 
correspond to asymptotic behavior in which the system of particles breaks into 
essentially disconnected components, the so called clustering. 

For a symmetric matrix S and an arbitrary function g, the matrix equation 
S = [S, H a g{S)] admits a similar solution by factorization: conjugate the initial 
condition by [e tff( -' s< - ' ) - ) ] ^. It is this last fact which historically was responsible 
for relating the QR iteration and Toda flows (|jl5), H): when g(x) — log(x), the 
resulting differential equation gives rise to the orbits interpolating the QR iteration. 
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For applications of these differential equations to numerical analysis, the reader may 
consult 0. 

Geometric aspects of slices 

Detailed study of variables for which the Toda flow becomes especially simple 
led Moser (|fl]]) to a parametrization of Jacobi matrices. It turns out that Jacobi 
matrices have simple spectrum and its eigenvectors always have nonzero first co- 
ordinates — in particular, they can be normalized so as to have (strictly) positive 
first coordinates. Moser proved that the map taking Jacobi matrices to n-uples of 
distinct real numbers (the eigenvalues) and a point in the first octant of the unit 
sphere in R™ (the first coordinates of the normalized eigenvectors) is a diffeomor- 
phism. He then showed that an appropriate choice of Toda flow (i.e., of functional 
parameter g) gives rise to orbits joining any two Jacobi matrices with the same 
spectrum. Thus, given a Jacobi matrix J, Tj is the set of all Jacobi matrices with 
the same spectrum as J. (We remind the reader that Jacobi matrices always have 
simple spectrum.) From Moser's parametrization, Jacobi slices are diffeomorphic 
to M™" 1 . 

More generally, getting back to the definition of slices, we have seen that a 
matrix S' E F s is of the form S' = Q T SQ = RSR~\ where f(S) = QR for 
some function /. Keeping account of the requests on Q and _R, one may obtain a 
simple coordinatization of a slice. A matrix S is irreducible if S has no invariant 
subspace generated by a subset of the canonical vectors e%,. .. , e„. Indeed, there 
is a diffeomorphism between positive polynomials up to scalar multiplication and 
elements of Ts , for an irreducible symmetric matrix ( [jlO| ) . 

Which matrices are in the boundary dJ-j of the slice through a Jacobi matrix J? 
Numerical analysts knew that the diagonal matrix A = (Ai, . . . , A n ) and its per- 
muted counterparts A T = (X n m, ■ . . , A^)). As we shall see, the simplest possible 
way of combining such n\ points to form a reasonable boundary turns out to be the 
topological description of dJ-j. Let Vk be the permutohedron, obtained by taking 
the convex closure of the points of the form 6 K". Notice that actually Vk has 
an interior of dimension n — 1, since the sum of the coordinates of any point in V\ 
equals the trace of J. The permutohedron Va is homeomorphic to the closure of 
J-j : as was proved in Jl8| with combinatorial arguments. 

The figure below describes the situation for a 3 x 3 Jacobi matrix J with spectrum 
er(J) = {4,2,1}. On the left is a topological representation of the closure of the 
slice J- j. Interior points of the hexagon are points in Tj. The boundary consists of 
two kinds of points: the six diagonal matrices, corresponding to the vertices, and 
the matrices which form the edges, which have either entry (1, 2) or (2, 3) equal 
to zero. On the right, the permutohedron associated to a(L) is projected on the 
(x, y) plane. Notice that a map taking vertices to vertices, say, taking diag (1, 2, 4) 
to (1,2,4) and diag (2, 1,4) to (2,1,4) may not behave so naively on all vertices, 
for continuity reasons. 

The permutohedron associated to a Jacobi 4x4 matrices is also drawn below. 
Faces have a clear meaning: there must be four hexagons associated to matrices for 
which entry (1,2) equals zero (and hence contained a Jacobi 3x3 block of fixed 
spectrum), six quadrilaterals corresponding to matrices with entry (2,3) equal to 
zero, and four more hexagons for matrices with entry (3, 4) equal to zero. There 
are also 4! vertices. 
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Figure 1. Slice and permutohedron for a (J) = {4, 2, 1} 
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FIGURE 2. The permutohedron for a (J) = {4, 3, 2, 1} 



The permutohedron is familiar to spectral theorists, from the well known Schur- 
Horn theorem ( |]l2| ). Let S be a symmetric matrix with simple spectrum a(S) = 
{Ai, . . . , A n }, and consider the set Os = {Q T SQ, Q orthogonal }. Then the map 
S i— ► diag S is surjective from Os to Va- Such a map is highly not injective. Slices 
are in a sense minimal sets in Os in bijection with permutohcdra. 

The presence of a convex polytope raised the possibility of relating slices to 
moment maps of symplectic toroidal actions (fjj,||). Indeed, following this route, 
Bloch Flaschka and Ratiu obtained an explicit diffeomorphism between the closure 
of J- j and V\, which we now describe. Let S be a symmetric matrix with simple 
spectrum, and consider the spectral decomposition S — Q T AQ, where, as usual, Q 
is orthogonal and A is diagonal. Now, order the diagonal entries of A in descending 
order: Q is then defined up to a choice of sign for each column of Q T (since they 
are normalized eigenvectors of S). Thus, S — Q T DADQ, for some diagonal matrix 
D of signs. The matrix DQAQ T D is dependent on D, but its diagonal is not! The 
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map S i— > diag (QAQ T ) is the BFR map. Its restriction to the closure of a Jacobi 
slice Tj is the required diffeomorphism to Pa (@)- 

The proof of the result above makes use of sophisticated machinery. However, 
once we knew what had to be proved, a simpler argument appeared, yielding a more 
general result (fnj). Let S be an arbitrary real, symmetric, irreducible matrix with 
simple spectrum. Call the diagonal matrices in the closure of Ts accessible, and 
their images under the BFR map the accessible vertices. Now let Vs, the spectral 
polytope of S, be the convex closure of the accessible vertices of S. 



Theorem: The BFR map is a diffeomorphism between the closure of the slice 
Ts and the spectral polytope Vs- 



Generically, the spectral polytope of a symmetric matrix with simple spectrum 
is indeed the permutohedron associated to the n\ diagonal arrangements of the 
eigenvalues. But this is not the case in general. A self-contained description of the 
spectral polytope of an irreducible matrix S is as follows. Consider the spectral 
decomposition, 5 = Q T AQ. We denote by Q{ ri ,...,r k },{ci,...,ck} the minor of Q 
consisting of the entries in the intersection of rows with indices r\ , . . . , rj, with 
columns with indices in c\, . . . , Ck- For a permutation pi in n symbols, let n denote 
the permutation matrix with entries ITj — 

Theorem: A diagonal matrix A w = LT T An is an accessible vertex of the slice Ts if 
and only if the minors Q/^i)},^}, Q{ 7 r(i), 7 r(2)},{i,2} ! ■ ■ ■ , Q{-k{i),... ,7r(n)},{i,... ,«} have 
nonzero determinant. 



As an example, let S = Q T AQ be a 3x3 matrix with eigenvalues A = diag (4, 2, 1), 
and so that the entry Q\,\ equals zero, but no other minor of Q has determinat equal 
to zero. The slice Ts and the spectral polytope have only four vertices: diagonal 
matrices with entry (1,1) equal to 4 do not belong to Ts-, and the vertices of the 
spectral polytope are (2, 1, 4), (2, 4, 1), (1, 2, 4) and (1, 4, 2) — Ts is a quadrilateral. 
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